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ABSTRACT 

It has been long recognised that, besides being a formidable experiment to observe the pri- 
mordial CMB anisotropies, Planck will also have the capability to detect galaxy clusters via 
their SZ imprint. In this paper constraints on cosmological parameters derivable from the 
Planck cluster candidate sample are examined for the first time as a function of cluster sample 
selection and purity obtained from realistic simulations of the microwave sky at the Planck ob- 
serving frequency bands, observation process modelling and a cluster extraction pipeline. In 
particular, we employ a multi-frequency matched filtering (MFMF) method to recover clusters 
from mock simulations of Planck observations. Obtainable cosmological constraints under re- 
alistic assumptions of priors and knowledge about cluster redshifts are discussed. Just relying 
on cluster redshift abundances without making use of recovered cluster fluxes, it is shown that 
from the Planck cluster catalogue cosmological constraints comparable to the ones derived 
from recent primordial CMB power spectrum measurements can be achieved. For example, 
for a concordance ACDM model and a redshift binning of Az = 0.1, the la uncertainties on 
the values of £2 m and as are Ai2„, w 0.031 and Aog ~ 0.014 respectively. Furthermore, we find 
that the constraint of the matter density depends strongly on the prior which can be imposed 
on the Hubble parameter by other observational means. 

Key words: cosmology: large-scale structure of the Universe - cosmology: cosmic mi- 
crowave background - cosmology: theory - methods: data analysis - methods: statistical - 
space vehicles: Planck. 



1 INTRODUCTION 

The c osmological potential of gala x y cluster surveys via the SZ ef - 
fect dSunvaev & Zeldovicr] dl97Ch : fSunvaev & Zel'dovichl d 19721) 



andlSunvaev & Zeldovichld 19801): recent revie ws: lRephaelildl99'5T) ; 
Birkinshaw ( 1999) and lCarlstrom et iD|2002)) has been advocated 
by many theore ti cal papers in rece n t years ( see e.g. Bartlettl 
Bartlettf j20oTh; ICohn & Kadotal d2005l) : iHaiman et alT 



2000); 



2001); 



Molnar et alJd2004h etc.). Future measurements of the number den- 
sity and distribution of clusters will have a profound impact on our 
unders t anding of the nature of the Universe (see e. g. IBahcall et al.l 
dl999h : lBofrringer & Schueckeil d2003h : l\^it1 d2005l) ). The SZ effect 
is due to its redshift independence especially valuable for detecting 
clusters. Apart from SZ dedicated surveys (AClQ; AMQ; Amibcfj; 
APEX-S^l SPT0; SZ/*@) which observe fractions of the sky, the 
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Planck surveyor satellitfl which is scheduled for launch in 2008, 
will provide detailed full-sky maps at nine different observing fre- 
quencies, ranging from 30 to 857 GHz. It is thus a particular suit- 
able instrument to detect the thermal SZ effect owing to its distinct 
frequency dependence. 

Recently several authors have investigated the properties 
of a Planck SZ cluster sample by applying different object 
detection algorithms to s i mulate d Planck channel ob se rvations 



(see e . g. lAghanim et al.1 d2005l): iDiego et ai] d2002l): iHansenl 



20041) : iHerranz et al 




sgo et 

iKav et al.l d200ll): iPierpaoli et all 
20051) : iSchafer et al.1 ( 120061) : ISchaefer & BartelmanrJ d2006n ; 
IWhitd d2003h ). For ;xample, iDiegoetalJ d2002l) developed a 
Bayesian non-parametric method to detect clusters in Planck data, 
which combines Planck frequency channels in such a way that 
the signal of contaminating components is reduced with respect 
to the cluster SZ one. Clusters are t hen extracted from the r e- 
sulting map by empl oying SExtractor dBertin & Arnoutsl dl99ot) ). 
IHerranz et all d2002l) and ISchaefer & BartelmanrJ d200€ ) imple - 
mented matched and scale adaptive filtering dSanz et al. Id200l|)) 
techniques to recover galaxy clusters and t heir photometry from 
Planck multi-frequency observations. While Herranz et al. ( 2002) 
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work in the Fou rier domain and apply the filte rs to a 12.8 x 12.8 
deg 2 sky patch. [Schaefer & Bartelmani] d2006h work in spherical 
harmonic space and apply the scale adaptive and matched filter- 
ing technique to full -sky Planck s i mul ations using the HEALPIX 
pixelisation scheme l Gorski etHEl d2005h ) to store the data. Also 
ISchulz & WhiteU2003fr use a simple matched filtering algorithm to 
extract clusters from a map resulti ng from a hypothesised combina- 
tion of Planck frequency channels. iPierpaoli et al. I fcOOSl) discusses 
a wavelet based method for component separation designed to re- 
cover non-Gaussian, spatially localized and sparse signals. A com- 
parison of the selection and contaminatio n of wavelet and matc hed 
filtering extracti o n tech niques is given in IVale & White] d2006h . In 
Geisb usch et alj J2005t) (hereafter GKH05), we applied a cluster 
extraction algorithm that combines the Harmonic Space Max imum 
Entropy Method (HSMEM; see also IStolvarovetafl J2002h ) with 
a Peak Finding Flux Integration method to recover galaxy clusters 
from realistic full-sky Planck simulations based on the HEALPIX 
pixelisation scheme. Furthermore, there have been purely theoreti- 
cal efforts based on the cluster mass function to estimate the power 
of the Planck cluste r catalogue to co n strain c osmological model 
param eters (see e.g. iBattve & Welled J2003h : [Majumdar & Mohj 
(2004)). Their redshift mass detection limits rely only on simple 
noise estimates, i.e. the instrumental noise levels, rather than per- 
forming realistic simulations and applying cluster extraction algo- 
rithms. 

Hence, so far there has not been a study that bases its cos- 
mological parameter constraints forecast on selection and contami- 
nation estimates derived directly from realistic simulations and SZ 
cluster extraction algorithms. Therefore, this work attempts for the 
first time to place constraints on the basis of a realistic Planck clus- 
ter detection pipeline. Here we use the popular matched filtering 
technique to assemble a cluster candidate catalogue, from which by 
comparison with the cluster input population the selection and con- 
tamination of the cluster samples are derived. Based on the cluster 
catalogue properties (selection and purity), we examine the con- 
straints which can be placed on cosmological parameters, mainly 
Cl m and Gg . 

The paper is organised as follows. In section [2] we give a 
brief definition of the SZ effect. Besides the general composition 
of (mock) observations at microwave frequencies, the theoretical 
basics and the formalism of the used cluster extraction method are 
described in section [3] Section [4] summarises details of our per- 
formed mock simulations and discusses their ingredients. The im- 
plementation of the cluster detection algorithm is discussed in sec- 
tion|5] Further, in this section properties of the obtained catalogue, 
mainly its completeness and purity, are investigated. Cosmological 
constraints obtainable from the detectable cluster abundance based 
on the efficiency of the extraction method when applied to Planck 
data (from the Planck cluster sample) under different assumptions 
of the fiducial cosmology, knowledge about cluster redshifts and 
the Hubble parameter prior are presented in section [6] Finally, we 
close our discussion and conclude in section [7] 



effect is given by 
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where x = hv/k B T with T = 2.725 K dMafher et all Jl999h ) and 
/() = 2Ic^Tq /h 2 c 2 . The first term in equation Q} is the so called 
thermal SZ effect due to the thermal motion of electrons of the 
intra-cluster gas. The thermal SZ effect has a spectral shape given 
by 
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and a frequency independent magnitude, the Comptonization pa- 
rameter, 
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In hot clusters (T e > 5keV) the relativistic electrons present 
slightly modify the spectr al shape of the thermal SZ effect 
( Challi nor & Lasenbvl dl998h ). This resulting relativistic correction 
has not been taken into account in this work, since its effect on 
the results presente d is negligible. The detecta bility of the effect 
from thermal (e.g. Pointecouteau et al ] dl998h ) and non-thermal 
(e.g. lEnsslin & Hansen! <2004D ) relativistic electrons has been es- 
timated elsewhere. It is still a matter of debate if Planck will be 
able to detect relativistic SZ contributions. The spectral shape of 
the second contribution in equation (TJ, the kinematic SZ effect, is 
given by 



h(x) 
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(4) 



and its magnitude, (3 = v pec /c, depends on the uniform peculiar 
line-of-sight bulk motion of the cluster's electron plasma, v pec . 



1 = err 



dh 



(5) 



is the Thomson optical depth. In the case the cluster can be assumed 
to be isothermal the Comptonization parameter can be expressed by 



m e c 2 



(6) 



In this paper we concentrate on the thermal SZ effect and treat the 
kinematic merely as a contaminant to the thermal SZ. 



3 CLUSTER DETECTION METHOD 

Before describing the method utilised, a brief schematic overview 
of the nature of SZ cluster survey observations and known con- 
tributing CMB components is given. Based on these considerations 
an assessment can be made of the requirements that separation 
techniques have to satisfy. More detailed discussions about which 
components are of importance to microwave observations at the 
Planck observing frequencies are presented in section|4] 



2 THE SUNYAEV-ZEL'DOVICH EFFECT IN BRIEF 

The anisotropy in the microwave band caused by the SZ effect can 
be separated into two contributions which are distinguished by the 
origin of energy of the scattering electrons that is responsible for 
the shift of photon frequency. The total distortion due to the SZ 



3.1 Mock observations 

The various components contributing to observations at the Planck 
frequencies have either different physical processes and/or differ- 
ent sources as origin. Their contributions therefore vary with fre- 
quency and scale. The SZ effect - the component of interest - as 
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described in the previous section is due to inverse Compton scat- 
tering of CMB photons off electrons inside galaxy clusters, which 
are localised extended objects. In microwave observations the av- 
erage cluster appears as a source with an extension of the order of a 
few arcminutes ignoring instrument dependent beam convolution. 
Several other components of different nature contribute to SZ ob- 
servations as a background or foreground. Here, we briefly mention 
the most important ones. First of all, there is the primordial CMB 
component. According to standard inflationary theories, which are 
in good agreement with constraints placed by recent observations, 
this component is a homogeneous random Gaussian field entirely 
described by its power spectrum. Cumulatively, field point sources 
contribute in an isotropic manner to SZ observations in the radio 
and far-infra-red wavelength regime. Furthermore, in the Galactic 
plane dust, free-free and synchrotron emission from the Milky Way, 
our Galaxy, also represent within certain wavelength regimes an 
important source of confusion. The components mentioned so far 
are all of cosmological or astrophysical nature. The spatial reso- 
lution of the observations is limited by the instrument design and 
the resulting instrumental beam. A component of a different kind, 
which unavoidably corrupts the observed data is the instrumental 
noise. Hence, generally, a SZ observation at a single frequency v 
can be modelled by: 

d v ( x ) = Y, *v i ( x ) + «v ( x ) i (7) 
i 

where s v /( x ) is the contribution at position x of the ith cluster and 
n v (x) gives the cumulative 'noise' contribution (including all other 
components) to the data d v at x. For a single frequency survey d v is 
a scalar field. In the following, s V j(x) refers to the thermal SZ con- 
tribution of the cluster, whereas other SZ components, if present are 
regarded as noise. Even though point sources are localised objects, 
in the following their collective contribution is regarded as a single 
diffuse noise component. 

Building on equation ((7} a multi-frequency observation is de- 
scribed by: 

d(x)=£ S/ (x)+n(x), (8) 

i 

where d' = (d Vl ,d Vl , ...,d v J, s' = (s Vl i,Sv 2 i,...,Sv n i) and n' = 
(«vi , M v 2 ) ■•■>%„) are transposed column vectors of the data, the ith 
cluster SZ signal and the noise. Their components are the particu- 
lar values and contributions respectively at observing frequencies 
Vi,V 2 ,...,V„. 



3.2 Matched filtering 

This section discusses the matched filtering technique, which 
utilises spatial as well as spectral information to detect SZ decre- 
ments (increments) of galaxy clusters. The l i teratur e also refers to it 
as optimal filter (e.g. Haehnelt & Tegmarkl Jl996l) ). In which way 
the matched filter is an optimal one is discussed later in this sec- 
tion. The matched filter is a template cluster extraction method. A 
common cluster temp late is, for example, the sphe rically symmet- 
ric P-profile (see e.g. ICavaliere & Fusc o-Femianol il976l) ). In this 
case the observed SZ signal for the ith cluster at position x = at 
frequency V; is given by: 

M x ) = /Bv / (x-x')./vA Vrcf/ T(x')rf 2 x' 

= jBvjix-^fv^ill + W/rctfihlW, (9) 



where A Vrd ,■ is the amplitude of the ith cluster at the reference fre- 
quency v le f, r c ; determines the spatial cluster scale (cluster core 
radius), By. is the instrumental beam at frequency Vj and fy j the 
frequency conversion factor (fy ta = 1). T represents the spatial tem- 
plate normalized to unit amplitude. 

In constructing the matched filter for a given profile, the 
noise «v,( x ) is assumed to be homogeneous with average value 
{n Vj (x)) =0 and cross-power spectrum P VjVj (k) defined by: 

(n Vi (kK.(kO) = /V,(k)8 D (k'-k), (10) 

where « v , (k) is the Fourier transform of the noise n Vl (x), rejj.(k') 
denotes its complex conjungate and 8d is the Dirac delta function. 
The homogeneity of the noise ensures that its statistical properties 
are independent of position. Cosmic backgrounds such as the pri- 
mordial CMB and point sources, as well as the instrumental noise, 
meet this requirement of statistical homogeneity. Globally, Galactic 
components, such as Galactic dust emission, are not homogeneous. 
However, on the scale of clusters (~ several arcmin 2 ) homogeneity 
of these components is a reasonable assumption. Our approach to 
estimating the background noise cross-power spectrum is discussed 
in section [4] 

Moreover, Galactic and point source contributions are caused 
by emission processes. Therefore, they (always) cause physically 
an increment in the observed temperature. This violates the require- 
ment of zero mean and leads to a biasing of the recovered signal 
amplitude of the ith cluster, A Vrt ,-. Assuming a central limit and 
taking off the zero Fourier modqj, the map is modelled in simula- 
tions to have a zero mean. This is a fairly safe modelling approach 
of CMB sky observations since the monopole on the targeted sky 
patch is usually unobserved by common instrumental designs. The 
required (n Vj (x)) = can thus be satisfied. The fact that the zero 
Fourier mode (monopole) is unobserved has a negligible effect on 
measured cluster fluxes. 

Given the assumptions of zero mean and spatial homogeneity 
the optimal matched filter is then derived as follows. If the sky is 
observed at n f frequencies, the most general linear estimator of the 
amplitude A Vrcr , (x) of cluster i at position x is given by: 

A^,.(x) = |>P(x-x')-d(x')d 2 x' 

= £(/ i M*-*)«M*)d 2 *'). (ii) 

where *P and d are column vectors each containing nj components. 
The components of l P' = (\|/ Vl , \|/ v , , . . . , \|/ V; ^ ) are frequency depen- 
dent weight functions. Ay St fJ (x) represents the estimate of the clus- 
ter SZ signal amplitude. This convolution (equationl 1 1 \ can be writ- 
ten in Fourier space conveniently as a product: 

= £ (/ *j (k)Vv, (kK ik "d 2 k) , (12) 

where dv and *P V denote the Fourier transform of the data and 
filter at frequency Vj. 

In the case of the matched filter one requires the weight func- 
tion vector f (the filter) to satisfy the following criteria: 

(i) The quantity A^' f is an unbiased estimator of the SZ ampli- 
tude of the cluster. Thus (A^ f ) = A Vtef is required. 



Here it is assumed that the emission components' pixel temperature val- 
ues scatter roughly symmetrically about their mean. 
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(ii) The variance of the noise of the estimator, o 2 , is minimized 
by the filter, which ensures that is an efficient estimator. 

The requirement of being an unbiased estimator fixes the nor- 
malisation of the filter by: 



d z x-A v 



0, 



(13) 



where the brakets {) denote an average estimate for a specified 
spatial cluster template centred at the origin (x = 0) obtained over 
many noise realisations. The filter shape is determined by demand- 
ing the variance of the estimate, 



O 2 = <«,-) 2 >-« r } 2 , 



(14) 



to be minimal. This minimization of o" 2 ensures that the shape of 
the filter is optimally chosen to be maximally sensitive to modes 
at which the cluster signal exceeds the noise. The multi-frequency 
filter satisfying these c onditions is given in Fourier s p ace by the ma- 
trix eq uation (see also Haehnelt & Tegmark dl996l) ; lllerranz et al.l 
J2002|) )FI 



*(k) = aP _1 (k)F(k), 

a = (J F'(k)p-'(k)F(k)d 2 



(15) 



= a 2 , (16) 

where F is a column vector described by F' = 
(/ Vl B Vl T,/v 2 B v ,t, ...,/ Vn 6 Vn t) and P 1 is the inverse of the 
noise cross power spectrum matrix with components P VjV . (k). 
Equation [16] gives the so-called multi-frequency matched filter 
(MFMF), which is an e xtension of the single-freq uency matched 
filter (SFMF) derived in lHaehnelt & Tegmark! dl996h . 



4 MICROWAVE SKY SIMULATIONS 

In succeeding sections, extraction algorithms are applied to simu- 
lated Planck observations of 5 x 5deg 2 sky patches. In particular, 
data is simulated for the Planck High Frequency Instrument (HFI) 
channels at 100 GHz, 143 GHz, 217 GHz, 353 GHz, 545 GHz and 
857 GHz. Due to their resolution and/or the ratio of the SZ sig- 
nal in comparison to amplitudes of other fore-/backgrounds within 
the channel bandwidths, three of these HFI frequency channels - 
namely the 100 GHz, 143 GHz and 353 GHz channel - are the 
most useful ones of Planck for galaxy cluster detection via the SZ 
effect^ Simulated sample patches outside and within the Galac- 
tic plane observed at the frequencies of these three channels are 

9 Due to the fact that clusters have on average an angular scale of a few 
arcminutes, the flat sky approximation and therefore working in Fourier 
instead of spherical harmonic space is an adequate approximation. 

10 Note that Planck channels which have not been taken into account in 
this analysis provide extra information about the SZ signal and contami- 
nants. Adding these channels in the analysis hence causes a slight increase 
in the number of detected clusters and decreases the number of false de- 
tections. Trial runs suggest that this affects the total cluster number count 
by less than 10 per cent. The obtained completeness estimate is thus a con- 
servative lower one. We restrict our analysis here to the HFI channels in- 
cluding the three most important frequencies for SZ cluster detections to 
keep computational cost down. For the purpose of estimating cosmological 
constraints derivable from the Planck cluster sample, effects on the cluster 
number count of this order are of minor relevance. Moreover, other effects 
such as the uncertainty of the mass function are of similar order of magni- 
tude. 



shown in Figure Q] to give an impression of the expected qual- 
ity of the Planck mission data. For the purpose of estimating the 
performance of the extraction techniques when applied to Planck 
data, patches residing within the Galactic plane and outside of it 
have been simulated in the right proportion. In the case that ob- 
served patches lie within the Galactic plane, the simulations have 
to include modelling of the Galactic dust, synchrotron and free-free 
emision besides the primordial CMB, the SZ effects, extragalactic 
point sources and instrumental effects. Actually, Galactic dust is 
by far the most dominant Galactic component at the frequencies 
of interest. Realisations of the Galactic components, the primordial 
CMB and the SZ effects have been obtained similar to the ones 
described in GKH05. However, the dust modelling uses this time 
a two temperature model. Moreover, the anisotropic nature of the 
Planck instrumental noise on the sky due to the scan pattern of the 
satellite is taken into account. The extragalactic point source popu- 
lation has been modelled in a differen t way than previously. I nstead 
of utilising the theoretical model of Toff olatti et alj 0998) as in 
GKH05, which has been found to match the WMAP point source 
detections within a factor of two at frequencies below 100 GHz, 
a phenomenological approach has been taken this time to obtain 
number counts of the radio and far infra-red/sub-mm point sources 
at the Planck observin g frequencies. Extrapolations of WMAP data 
( iBennett et all §003)) suggest that the contamination due to ra- 
dio sources is marginal above 100 GHz. While radio point sources 
dominate below 100 GHz, at the Planck channel frequencies of 
interest (v ^ 100 GHz) the confusion caused by dusty luminous 
infra-red sources is most important. The extragalactic far infra- 
red/sub-mm (IR/SM) source count modelling performed here is 
based on the 350 GHz observation s of the Submi l limetr e Common 
User Bolometer Array (SCUBA; iHolland et all d 19991) ) mounted 
on the lames Clerk Maxwell Telescope (until 2003). There have 
been s e veral deep observat io ns made with SCUBA dSmail et al 



1 1997 1 ) ; iBargeretal 



.19981) ; IHolland et al.l dl998l) ; iHughes etal. 
< 19981) : lEales et alj 41999)) from which source counts have been 
obtained. SCUBA blank field counts of the IR/ SM sourc e popu - 
lation on larger fi elds have been obtain ed by IScott et all d2002l) . 
iBorvs et alj d2003l) and lScott et"al] d2006l) . From this work one can 
derive phenomenological fitting formula to the source counts at 350 
GHz. The extrapolation of these source counts to lower frequencies 
(100 $; v < 300 GHz), however, involves substantial uncertainties 
since the spectral behaviour of the sources is not (very) well known 
and may change significantly from one point source to another. Fur- 
thermore, due to a general lack of observations in the frequency 
regime, knowledge of the emission of extragalactic point sources 
at intermediate HFI Planck channels is relatively poor and one has 
to rely on extrapolations based on assumptions about the source 
spectra. In the (rather unlikely) case of precise knowledge of the 
spectral behaviour, one could measure the source flux at higher fre- 
quencies (v 350 GHz) at which these sources dominate and sub- 
tract off appropriate flux levels at the frequencies of interest. Here, 
we take a mean spectral index of a = 2.6 and assume a rms scat- 
ter of C a = 0.3 around this mean for individual galaxies to do the 
spectral extrapolation. We further assume that IR/SM sources rele- 
vant to Planck observations are spatially uncorrelated with clusters 
detectable by Planck. Given that luminous dusty galaxies are ex- 
pected to be at high redshifts (z > 1) this should be a reasonable 
assumption. 



Flat sky patches can be obtained from full sky Planck data by 
stereographic projection of reasonably sized regions of the sphere 
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(pixels of 1.5 arcmin) 



(pixels of 1.5 arcmin) 



4 - GHz Planck charm- 




(pixels of 1.5 arcmin) 



Figure 1. 5 x 5 deg 2 realisations of Planck observations at the three observing frequencies of the satellite which are most important for cluster detection via 
the thermal SZ effect. The upper panels show the observed patches at 100 GHz, the mid panels show the observed patches at 143 GHz and the bottom panels 
show the observed patches at 353 GHz. The left column shows a patch lying outside the Galactic plane and the right column a patch within the Galactic plane. 
The same pixelisation scheme and primordial CMB realisation have been adopted for each channel map. 
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onto planes tangential to the sphere at the centres of the patches^ 
Figure Q] shows simulated observations of example patches at three 
considered Planck channels. Even though the patch shown in the 
left column lies outside the Galactic plane (low dust region) the 
identification of galaxy clusters by eye in the map is impossible. 

This approach of splitting up the sky into patches is in the 
case of optimal filtering the preferred one, since background noise 
levels vary significantly over the full sky. For the matched filter 
method, it is therefore necessary to have some estimate of the lo- 
cal background noise power spectrum. This knowledge has been 
assumed to be available (to some realistic degree). For example, it 
can be gained from the HSMEM separation which has the ability to 
recover several physical components at ones, each of which is spa- 
tially distinct and has a different frequency dependence. However, 
as a zeroth order estimate of the noise power spectrum one may 
also take the power spectrum of the sky patch under consideration 
and then iterate until the extracted cluster number and the power 
spectrum estimate of the noise converge. Explicitly, this is done 
by removing in each iteration step the number of clusters detected 
above a threshold (e.g. above a signal-to-noise threshold of 3a) 
from the map and taking the residual map as the new background 
noise estimate. If background noise estimates are wrong, the detec- 
tion significance (signal-to-noise ratio) returned by the method is 
systematically flawed - hence this is a source of systematic error. 

Before applying a MFMF cluster extraction algorithm to the 
data, one can perform a (pre-)cleaning of the channel maps to re- 
duce the level of certain contaminants and increase the signal-to- 
noise ratio of the SZ signal. Point sources, for exa mple, can be 
remov ed by a Mexican Hat wavelet technique (see e.g. lVielva et al.l 
J200ll) ). Since the 857 GHz channel is completely and the 545 GHz 
channel mostly dominated by Galactic dust emission within the 
plane of our Galaxy after the removal of IR/SM point sources, these 
channels can be used on their own or in combination to remove 
the dust emission of the Milky Way at lower frequencies. Further- 
more, one might lower the level of primordial CMB contamination 
by using the 217 GHz channel a nd performing spatial filtering in 
spherical harmonic/Fourier space. iHerranz et al] J2002h found that 
if all this cleaning is performed the cluster number count obtained 
by a MFMF method above a 3a detection threshold is increased 
by 7 percent and the false detection rate lowered by 12 percent in 
comparison to the results obtained when the MFMF scheme is ap- 
plied directly to the 'raw' data. At higher signal-to-noise detection 
thresholds this gain is expected to be even less (in the following a 
5a detection threshold is used). This suggests that even on its own 
a MFMF method yields robust and reliable results. Thus such a 
(pre-)cleaning step has not been included in our analysis pipeline. 



5 MFMF CLUSTER EXTRACTION 

In the following we apply multi-frequency matched filters to the 
sky observation simulations described above. The filters are con- 
structed according to the instructions given in section [X2l A cluster 
candidate catalogue is compiled by using a cluster extraction algo- 
rithm based on multi-frequency matched filtering consisting of a 
number of steps explained in this section. 



Note that the patch size should not exceed 15 degrees at most to avoid 
significant structural deformations. 



5.1 Extracting thermal SZ cluster signals 

Apart from the initial convolution of the multi-frequency data with 
the diverse filter kernels to generate a multi-dimensional detection 
likelihood space, the extraction algorithm consists of several steps 
which are iteratively repeated until all candidates with detection 
significances above the required thresh old are obtained. The a lgo- 
rithm is similar to the one sua gested bv lSchulz & White! {2003). 

First, the frequency channel maps are convolved with filters 
whose spatial scales are gradually varied. The upper and lower limit 
of the spatial filter scale are determined by the expected range of 
sizes of detectable clusters on the sky. Besides, performing scale 
dilation the shape of the cluster template used to construct the filter 
kernels can also be varied. Since the complexity of several steps of 
the algorithm scales (linearly) with the diversity of the filter kernels, 
for a given patch of fixed pixel resolution the computational cost of 
the algorithm depends strongly on the number of distinct filter ker- 
nels applied. The optimal discretisation of the kernels depends on 
the data at hand. For example, due to the beam sizes of the Planck 
channels which are rather large in comparison to the average cluster 
size, one does not expect to gain (much) valuable information about 
the scales of the unresolved majority of clusters[3 Thus, a rather 
coarse filter scaling should be adequate on scales not resolved by 
the Planck beams. However, as we discuss in section [5"31 ofherwise 
a fine filter scale discretisation is advantageous. In order to con- 
struct a detection likelihood space, the convolved multi-frequency 
channel maps are co-added and normalised to unit variance for each 
filter kernel as described in section [3~!2l 

Subsequently, at each iteration the cluster candidate with the 
highest signal-to-noise ratio is identified in the unit variance nor- 
malised 'detection likelihood space' spanned (in our case) by the 
position parameters and the discretized filter scale. A bright clus- 
ter has a high detection significance at various filtering scales at 
(roughly) the same sky position. Its photometric parameters are 
determined on the basis of the variation of the detection likeli- 
hood (peak height in the normalised likelihood space) with filter 
scale. For the filter scale that is closest to the true scale of the 
cluster, its signal-to-noise ratio should become maximum. The de- 
tection is added to the cluster candidate list and its estimated sig- 
nal is subtracted from the cube. This removal takes place directly 
within the detection likelihood space. It is realised by subtracting 
off an amplitude normalised template at the candidate's sky posi- 
tion. The template consists of the unit peak (most likely) cluster 
candidate shape convolved at each scale by the respective filter ker- 
nel. Since the number of filter scales is limited[3 the template can 
be (pre-)generated and stored for (further) applications of the un- 
altered algorithm. The amplitude to which the template is finally 
normalised corresponds to the most likely central cluster candidate 
SZ distortion in units of the standard deviation of the co-added filter 
kernel map. 

Thereafter, the algorithm is reapplied. The procedure com- 
mencing from the step of locating the most significant detection is 
repeated in a loop on the residual detection likelihood maps until no 
further detection is found above the chosen signal-to-noise thresh- 
old. Thus the extraction algorithm represe nts an iterative approach 
similar to a CLEANing procedure (see e.g. 

HogbqnJ jl974) ; IClarkl 



It is referred to a cluster as being unresolved in the case that the clus- 
ter's entire flux is picked up by one instrumental beam pointing. The cluster 
therefore appears as a point source in the Planck data. 
13 This causes the best scale estimates of the candidates to be discretised 
as well. 



© 0000 RAS, MNRAS 000, 000-000 



Cosmology with the Planck cluster sample 7 



(1980)). This procedure represents a convenient way to disentangle 
cluster-cluster confusion as long as the detection of highest sig- 
nificance coincides with the brightest remaining cluster candidate 
on the patch. This is usually the case. Therefore, by its removal the 
signal-to-noise ratios of the rest of the candidate detections become 
unbiased. However, occasionally due to biasing the signal-to-noise 
ratio of a candidate is overestimated. For example, in the case that 
filtering at various scales indicates that the projected SZ signal of 
several clusters might overlap and the single candidate detection 
significance might be biased due to the overlap or even the detec- 
tion might be a false one, one has to (re-)assure that the detection 
significances are to the largest possible extent unbiassed. This can 
be tested by varying the order of iterative removal of the candidate 
detections under consideration. It is the candidate whose signal- 
to-noise ratio varies the least which should be removed first. Such 
overlaps sometimes occur by chance due to line-of-sight projec- 
tion over a deep light-cone, as well as in supercluster environments 
in which clusters are situated close to each other even in redshift 
space. 

Moreover, splitting up the observed patch into seperate re- 
gions, in such a way that none of the detections in one region 
is affected noticeably by one of the other regions and vice versa, 
speeds up the algorithm slightly by reducing the number of itera- 
tions required until all candidate detections above a specified detec- 
tion threshold are found. The number of iteration^ reduces from 
the number of all candidates to the highest number of candidates 
in one of the seperate regions above the signal to noise threshold. 
However, the speed up is mostly due to the minimisation of space 
in which operations have to be performed. It also provides a way to 
parallelise the extraction algorithm when it is performed on large 
datasets. The angular clustering of galaxy clusters and the largest 
scale covered by the set of filter kernels place a lower limit on the 
patch (region) size. The size has always to be chosen large enough 
so that all clusters affecting each other are located in the same patch 
(region). 

After completion of the algorithm, a cluster candidate cata- 
logue containing the candidate's position on the sky, scale, cen- 
tral amplitude, flux, morphological parameters, such as the (asymp- 
totic) slope of the profile, ellipticity and inclination angle0 (etc.) 
is at hand. 



5.2 Cluster catalogue properties 

After applying the MFMF algorithm to a representive number of 
sky patches of 5 x 5 deg 2 whose noise realisations (instrumental 
noise levels, Galactic foregrounds) are varied according to the pro- 
portion they occupy on the full sky, the properties of the obtained 
cluster candidate catalogue can be evaluated. For this purpose the 
candidates have to be associated with real clusters. 



Here we refer to an iteration as a step that ends with the listing and 
(complete) removal of one or several clusters. 

15 For the sake of minimising computational cost we did not vary morpho- 
logical parameters of the profile to create filter kernels. We rather assumed 
the cluster profile which has been used to simulate the SZ sky to be known. 
However, this makes no difference for unresolved clusters. In the case a 
cluster is resolved adequate 'morphological' variation of the filter kernel 
should in the limit return results as presented here. 




Figure 2. The integrated completeness of the cluster catalogue obtainable 
from the Planck survey using the MFMF algorithm. Besides matching can- 
didates with single clusters, candidate detections have also been allowed 
to have several clusters associated with them due to projection along the 
line-of-sight onto the sky. Single cluster matching is given by the solid line, 
while multi cluster matching is shown by the dashed line in each case. The 
integrated completeness is defined as V(Y C \ ^ F,z 0) = Ni et (Y c \ ^ Y,Z ^ 
0)/Ae X p(F c i Y,z 0), where Nfe t (Y c \ ^ Y,z 0) is the recovered number 
of clusters above a flux Y and N exp (Y c [ F,z ;S 0) is the expected cluster 
number above F over the whole sky and redshift range. 



5.2.1 Matching up candidates with clusters 

Based on the extracted cluster candidate list a matching is per- 
formed between candidates and clusters of the input cluster cat- 
alogue. Similar as in GKH05, a lower matching flux limit of 
Y = 3 x 10~ 4 is adopted to avoid dubious associations occuring 
just by chance. This flux threshold corresponds approximately to 
the analy t ically derived 3g point sou rce sensitivity of Planck (see 
lHaehneltl dl997h ; lBartelmanrj bOOlh ). A candidate is successfully 
matched up with a real cluster if the distance between their cen- 
tral positions does not exceed a predefined matching length. The 
association of candidates with clusters is a crucial point in the eval- 
uation of the performance of the extraction method since it affects 
directly the completeness and contamination estimates. For exam- 
ple, in GKH05 a matching length of just one pixel of ~ 2 arcmin 
was assumed for assigning clusters to candidates recovered from 
simulated Planck data. Such a conservative matching length leads 
to firm lower limits of the completeness and purity of the recov- 
ered candidate sample since every candidate that is not associated 
with a cluster due to the small matching length is considered to 
be a false detection. Henceforth, the extent of the matching length 
is chosen in a way to account for the fact that cluster candidate 
positions derived from Planck data are not highly accurate. Posi- 
tional uncertainties arise mainly due to limited instrumental resolu- 
tion and noise variations on the resolution scale since cluster core 
regions even of resolved clusters commonly fit into the Planck in- 
strumental beams. Therefore, we accept a match if the positional 
deviation (of the pixel centres) does not exceed a matching length 
of v2 x FWHM of the instrumental beam. As the resolution differs 
with channel, the average FWHM of the beams is taken for con- 
structing the matching distance. Moreover, the pixelisation of the 
maps is chosen fine enough to ensure that pixelisation effects do 
not play a (major) role. 

In the case that a candidate can be associated with several clus- 
ters, it is matched up with the cluster that yields the best flux match. 
In the reciprocal case of several candidates being matched to one 
real cluster we proceed in the same way by keeping the best flux 
match. However, in the following we also consider that multiple 
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clusters are associated with one candidate detection and vice versa. 
Due to the angular clustering of sources and the fairly large Planck 
beams, a disentanglement of contributing sources is sometimes im- 
possible and a candidate's flux estimate can correspond to the sum 
of several unresolved sources (of even similar flux). In rare cases re- 
solved low redshift clusters are detected as multiple candidates due 
to noise and background SZ variations on scales smaller than the 
cluster extent. Then the candidate which fits best the photometry of 
the real cluster is assigned to it. All the others are conservatively 
regarded as false detections. Liberally, they are simply disregarded 
in the case they fall within the matching region of the accepted can- 
didate. As already mentioned above, if a candidate is not matched 
up with any real cluster above the flux limit, it is regarded as a false 
detection. 



5.2.2 Catalogue completeness and purity 

Important measures of the quality of a recovered cluster catalogue 
are its completeness and purity. These also set benchmarks for the 
evaluation of how useful the catalogue can be for cosmological pur- 
poses. For example, a low completeness results in large uncertainty 
of the total number of clusters on an observed patch. A catalogue 
becomes useless if besides a low completeness it possesses as well 
a low purity. The few positive detections are then diluted by false 
ones and the catalogue can neither be used to make predictions 
about the cluster ensemble nor serve as a basis for follow-up obser- 
vations to learn more about individual objects. If one does not want 
to rely heavily on observations at other wavelengths (e.g. optical 
and X-ray cluster observations), which are possibly spoiled them- 
selves, the only way to determine these measures, the completeness 
and purity, is to perform realistic simulations whose ingredients, 
such as the underlying cluster sample, are known. 

Figure [2] shows the completeness of the catalogue extracted 
by the MFMF algorithm for candidates detected above a signal-to- 
noise threshold of 5. Above Y ~ 10~ 3 arcmin 2 the vast majority of 
clusters is recovered. At Y ~ 10~ 3 arcmin 2 the sample is still more 
than 50 percent complete. Below this flux regime the completeness 
of the sample falls steeply0The fraction of clusters with a real flux 
Y < 10~ 3 arcmin 2 that is matched up with a candidate of 5o detec- 
tion significance is low. As expected, the completeness is increased 
in the case that multiple clusters are permitted to be assigned to 
one candidate (see dashed line in Figure[2]l. At high cluster fluxes 
this is mainly due to halo clustering in redshift space (superclus- 
ter environment). Due to the low surface density of (massive) high 
flux clusters on the sky, cluster overlap is unlikely to happen just at 
random (the number density of clusters above the chosen matching 
flux limit of Y = 3 x 10~ 4 arcmin 2 is well below one cluster per 
square degree for the assumed cosmological models). Note as well 
that massive clusters are more strongly clustered than low mass 
ones. However, the probability of cluster-cluster confusion to occur 
by chance due to projection along the line-of-sight increases rapidly 
with decreasing flux. The shown completeness estimate gives an 
average as expected for a full-sky survey. Note that spatially the 
completeness varies strongly, depending on the instrumental noise 
and the Galactic foregrounds. 

Further, it turns out that the recovered candidate catalogue is 



16 Note that for visualisation purposes the flux (x-axis) is plotted logarith- 
mically in Figureff] This and the use of the integrated completeness weaken 
the impression of steepness of the curve. The remaining marginal complete- 
ness at low fluxes is entirely due to cluster detections at higher fluxes. 



of high purity. The rate of false detections contaminating the sam- 
ple of candidates with detections of a signal-to-noise ratio of ^ 5 is 
of the order of one percent. Note that due to a limited number of ex- 
tracted cluster candidates there is always a sample variance error on 
the estimated purity as well as on the completeness estimate. How- 
ever, by simulating and analysing an appropriate number of patches 
which ensures a high number of extracted candidates, the error on 
the estimates is minimised and a contamination of the candidate 
sample above ~ 3 percent can be excluded at high significance. On 
the basis of the completeness of this fairly pure recovered cluster 
sample, cosmological parameter constraints are derived in section 
[6] In the case the detection threshold is lowered, as expected, the 
completeness as well as the contamination increase. 

5.3 Some remarks 

Matched filtering is a template-based object detection approach. 
To extract the thermal SZ effect, templates e mpirica l ly de- 
rive d from fits to observations ( e.g. th e P-profile: I Kind i 19721) 
and ICavaliere & Fusco-Femianol 1 1978b) and/or by hydro static 
theoretical considerati ons (e.g. iKomatsu & Seliakl d20020 and 
ICoorav & S heth (2002)) are utilised. The filter spatial scale is var- 
ied by changing the characteristic radius of the cluster template 
(e.g. the core radius of a p-profile). Furthermore, one may also 
parametrise the template shape (e.g. variation of P in the case of the 
P-profile). The universality of the template is essential to guaranty 
a photometrically unbiased cluster candidate sample whose signal- 
to-noise ratio distribution is not systematically skewed. A universal 
template represents thus an average scalable shape of a cluster SZ 
imprint in the CMB. Due to different cluster environments and mor- 
phologies, the imprints caused by single clusters 'scatter' around 
the average one. 

In the case of Planck whose highest resolution of a frequency 
channel map is 5 arcmin (FWHM), exact knowledge of the clus- 
ter SZ template is of less importance than it is the case for high- 
resolution observations (such as AMI observations), since the con- 
volving instrumental beam erases much of the information on clus- 
ter (sub-)structure. Hence, good knowledge of th e beam shape is 
impor tant in the case of Planck. In previous work dSchulz & White] 
( 2003)) concerning a matched filter cluster recovery from Planck 
observations, clusters have been regarded as point sources and the 
beam shape has been used as template. However, even for 'low res- 
olution' Planck data varying template parameters (i.e. building a 
discretised template library) is advantageous in comparison to a 
single filter kernel and leads to a larger number of extracted clus- 
ters. Nevertheless, the finer the discretisation of the template pa- 
rameters for constructing matched filtering kernels is, the higher 
the number of reliably extracted candidates becomes. The compu- 
tation time needed by the algorithm is raised approximately linearly 
with filter kernel variety. At some point, however, the increase in 
consumed computing power outweighs the gain in recovered clus- 
ters. Therfore, there is always a trade-off between computational 
cost and maximising the number of reliably extracted candidates. 
Our implementation of the MFMF is tuned to be most efficient for 
Planck data with regard to computational cost and cluster extrac- 
tion. 

The candidate sample (flux) completeness above a chosen flux 
threshold (see Figure[2} has been derived on the basis of the concor- 
dance model and Planck's instrumental properties. The complete- 
ness is no doubt very sensitive to the instrumental design defining 
the instrumental noise level, resolution and number of frequency 
channels. Planck's instrumental properties can be expected to be 
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well known. On the contrary, the noise level due to cosmological 
contaminants depends like the cluster abundance on the cosmolog- 
ical model of our Universe. While the dependence of the primor- 
dial CMB on cosmological parameters is well understood, there 
is still little known about the cosmological dependence of number 
counts, auto-/cross-correlations and evolutions of point source pop- 
ulations at Planck's observing frequencies. Hence, we modelled the 
point source count empirically on the basis of recent observations. 
The primordial CMB power spectrum in our simulations has been 
chosen so that its shape agrees with observational findings from 
WMAP, VSA and other experiments. Furthermore, we studied the 
impact of variations of the cluster (surface) number density with 
cosmology on the completeness. Our testing shows that the com- 
pleteness is fairly insensitive to such variations at least for reason- 
able changes of cosmological parameters. This insensitivity can be 
explained by the in comparison to angular sizes of SZ sources fairly 
large Planck channel beams which smooth fluctuations in the SZ 
background. In the case that any significant discrepancies between 
our assumptions and future observations arise, the algorithms can 
be rerun on updated simulations to adjust the completeness esti- 
mates. Moreover, the absolute number of clusters which will be 
detectable by Planck depends on several factors. Apart from the 
cosmology of the Universe which heavily influences cluster detec- 
tion numbers, also c luster physics plays an important role (see e.g. 
Ida Silva et al]fe004h ). 



6 EXPECTED CONSTRAINTS ON COSMOLOGY FROM 
THE PLANCK SZ SURVEY 

It is well known and one of the major motivations of blank field SZ 
cluster surveys that the cluster abundance and redshift distribution 
is sensitive to cosmological parameters. On the basis of the 'blind' 
cluster extraction algorithm pursued above and its estimated clus- 
ter flux selection (the sample completeness at flux Y and redshift 
z, <if(Y,z) = N det (Y,z)/N exp (Y,z)) and purity, in the following the 
constraints one can obtain from a Planck SZ survey on cosmologi- 
cal parameters are investigated. 

6.1 Analysis methodology 

The cluster flux dependent selection is found to be approximately 
universal at redshifts z > 0. 1 for the implemented SZ extraction 
algorithm. The effect of approximately constant flux sensitivity at 
redshifts above z fs 0.1 is essentially due to the vast majority of 
clusters at these redshifts being unresolved. A strong exception to 
this universality occurs only at very low redshifts. Below z < 0.05 
the limiting flux at which the sample has a specific constant com- 
pleteness increases rapidly to higher fluxes with decreasing red- 
shift. However, since the affected volume is small (low redshift), 
the number of clusters missed is marginal so that the completeness 
above a limiting flux at redshift z (z > 0.1) does not differ from 
the redshift integrated one as shown in Figure|2]by a large margin. 
Moreover, all clusters at such low redshifts (z < 0.05) and with SZ 
fluxes comparable to those in the Planck sample should be (easily) 
detectable by other observational means. For example, they should 
have been detected by the ROSAT All Sky Survey (RASS). 

For the comparison of the theoretical predictions of the fidu- 
cial models with the ones of other models and subsequently for 
estimating the constraining power of the cluster sample at hand on 
cosmological parameters, we use a MCMC analysis. Our analy- 
sis is based on the Metropolis algorithm (see iMetropolis & Ulaml 



GUI)) to sample the (log-)likelihood function over the parameter 
space. This represents an efficient way of sampling. For the analy- 
sis, our basic parameter set consists of four cosmological parame- 
ters, Q m , Q.^, c 8 and h. Note that we do not assume that the Uni- 
verse has a flat geometr y, as it has been often do ne in recent works 
of other authors (see e.g. lBattve & Welleil d2003l) who put their em- 
phasis on constraining the 'nature' of dark energy with SZ cluster 
surveys). Other parameters which generally can be varied as well 
are kept fixed. For example, the spectral index is fixed to n s = 1 
(Harrison-Zel'dovich spectrum) and the baryon density Sly is set 
to the best fit WMAP value. Furthermore, our presented analysis 
is restricted to ACDM models (w = —1). The inability of cluster 
surveys on their own to constrain the Hubble parameter, causes us 
to place in the course of our analysis tight constraints on h obtained 
by other means, i.e. constraints from the Hubble Space Telescope 
Key Project (Gaussian prior: h = 0.7 ± 0.08). Nevertheless, we first 
examine the effect of a loose uniform h prior on constraints of other 
parameters. The parameter space spanned by the other cosmologi- 
cal parameters (fl m , £2a an d o"g) is uniformly sampled in our anal- 
ysis. 

Moreover, in the presented work, running a self-calibration 
analysis, in which apart from cosmological parameters cluster 
physics parameters are varied as well, has not been attempted. 
The normalisation of the mass-Comptonisation parameter (mass- 
observable) relation of clusters has been assumed to be a priori 
known and it has been assumed to not evolve with redshift. How- 
ever, due to the large number of clusters, which is of the order of 
10 3 for a detection significance of 5a and probable cosmologies 
of the Universe, such a self-calibration analysis is feasable with- 
out abandoning completely meaningful constraints on parameters. 
Since there is a rather large error on the reconstructed cluster fluxes, 
only total cluster numbers on the full sky and within chosen redshift 
bins are used here to derive parameter constraints. Investigating and 
understanding reconstructed cluster fluxes and deducing a reliable 
relation between them and real cluster fluxes might allow one to 
derive even tighter constraints on parameters and eases the grounds 
for a self-calibration analysis. 

On the basis of the cluster flux selection function and the 
flux to mass conversion, the mean expected cluster number of the 
fiducial models is compared with theoretical predictions of mod- 
els by using a Poisson-averaged likelihood in our MCMC analysis 
(also accounting for the (small) cluster candidate sample impurity 
which otherwise slightly biases the cosmological parameter con- 
straintsQ). Here, we want to emphasize that the selection functions 
(depending on cluster flux and redshift) for the performed extrac- 
tion algorithms are more complex than it has often been assumed 
in previous analyses of other authors. Most often simple step and 
symmetric selection functions have been applied in those studies. 
In the following, the found two dimensional (redshift and flux) se- 
lection of the MFMF cluster extraction method is used to derive 
cosmological parameter constraints. Note further that the choice of 
the parameterised cluster template can affect the cluster selection 
and as well bias cluster photometry and thus cosmological con- 
straints. A discrepancy of the presumed template from the average 
universal one of real clusters causes a reduction in the cluster detec- 



Note that in addition to predictions about the sample purity gained from 
simulations as described in this work, false detections will be exposed by 
follow-up observations, which need to be carried out to estimate cluster 
redshifts. Nevertheless, in order not to waste valuable observing time, a 
high purity of the chosen candidate sample is absolutely necessary. 
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tion efficiency. Generally, in addition to ignored sources of confu- 
sion, the selection function is mostly affected by the choice of the 
template and its parameterisation. In the case of spatially highly 
resolved multi-frequency data, a high adaptability of the parame- 
terised template is advantageous. Here, the same cluster template 
has been used for the SZ simulations and as the detection template. 
The detection efficiency is thus optimal. In reality, it will be an it- 
erative process to match the parameterised template and its param- 
eter priors to the profiles of real clusters. It would be a good test 
of the cluster extraction algorithm to apply it to Planck data whose 
SZ component has been realised by hydrodynamical N-body sim- 
ulations. However, at present such simulations of cosmic volumes 
and in quantities as needed to give robust predictions of the cluster 
selection for the Planck cluster survey are not available. 

Furthermore, the conversion used between the real cluster flux 
and the cluster mass is taken to be universal and assumed to be free 
of any dispersion. In principle there exists intrinsic scatter in the 
mass-flux relation due to differences in cluster environments and 
evolution histories. It is possible to include such an uncertainty in 
the relation by a convolution of the mass function. However, since 
hydrodynamical cluster simulations support the assumption that a 
tight correlation between the cluster m ass and flux exists (see e.g. 
Ida Silva et ail d2004h : lMotl et al.l d2005h ). we neither include a dis- 
persion of the scaling relation in the SZ simulations nor in the fol- 
lowing analysis of cosmological parameter constraints. Estimates 
of the scatter intrinsic to the mass-flux relation obtained from nu- 
merical cluster simulations determine it to be of the order of a few 
percent. In the case one wants to make use of the SZ cluster flux 
function (log 5 — log AO by binning clusters according to their re- 
constructed fluxes, it is in general not the intrinsic scatter in the 
mass-flux relation which causes the largest uncertainty of the num- 
ber of clusters contained in a flux bin. It is rather the scatter of 
the reconstructed fluxes to the real cluster fluxes that prevents one 
from finely flux binning the recovered sample. As it can be seen 
from the flux scatter plots shown in Figure[3]and Figures 9, 10 and 
12 of GKH05, which are comparable to the one of the algorithm 
employed in this work, the uncertainty in the relation of the recon- 
structed to the real cluster flux exceeds by far for most flux limits 
except for the highest flux clusters on the sky the intrinsic scatter 
of the mass-flux relation of individual clusters. 

6.2 Results 

In the following results on parameter constraints are given for sev- 
eral assumptions concerning the restrictions placed on parameters 
by priors (notably restrictions on the Hubble parameter, h) and for 
different degrees of effort of following-up the cluster sample in the 
optical for providing cluster redshifts (actually we distinguish be- 
tween no and complete follow-up). In doing so, we start from weak 
assumptions and assume an increase in knowledge about the sam- 
ple redshifts as well as tighten the prior on h. These actions lead, 
as expected, to ever tighter constraints on cosmological parameters 
obtainable from the cluster sample. 

Due to the redshift independence of the SZ effect and due to 
the resolution of the Planck channels, cluster redshift information 
cannot be gained from Planck data on its own. In the case of high 
resolution SZ observations, red shifts can be esti mated based on 
morphological observables (see lDiego et al.l J2003T) ). However, the 
error bars given by the morphological redshift estimation are large. 
Still such redshift determination might be useful for future SZ sur- 
veys, such as the SPT, which will presumably detect thousands of 
clusters. Obtaining redshifts for such a large number of clusters 




Figure 3. Scatter of the reconstructed cluster fluxes (F recon ) versus the real 
cluster fluxes C^input) of cluster candidates which are matched up with a 
simulation input cluster by the described matching algorithm. The scatter is 
shown for fluxes above F cllt = 1 x 10~ 3 arcmin 2 . 



by follow-up observations at different wavelengths represents cur- 
rently a major observational effort. Therefore, we first examine the 
case that no cluster redshift information is available. Figure|4]shows 
the 'constraints' that can be obtained from the total number of SZ 
detected clusters for Q.,„ and rjg. As expected the two parameters 
are completely degenerate. One can always find a combination of 
these two parameters, which mainly govern the mass function, that 
reproduces the observed total cluster number on the full sky. The 
particular shape of the degeneracy depends on the survey layout, 
e.g. the survey depth that can be reached. The degeneracy between 
£l m and a% can be broken so mewhat by utilising an gular correla- 
tion function information (see lMei & Bartlettl d2004h ). The 'width' 
of the degenerate constraints at a given (fixed) value of fl m or Os 
respectively depends strongly on the range of the h prior. Here we 
used a uniform prior on h with 0.4 ^ h ^ 1. Another way to break 
the degeneracy between Q. m and a$ and to constrain h at the same 
time is to carry out a combined data analysis utilising the Planck 
cluster sample and the primordial CMB power spectrum (or tem- 
perature and polarisation power spectra) obtainable from Planck 
data. 

In our first analysis only the ACDM concordance model 
(£!,„ = 0.3, £2a = 0.7, = 0.9 and h = 0.7) has been considered as 
fiducial model. Since the recently published results of the (primor- 
dial) cosmic microwave background analysis of the WMAP three 
year data prefer different parameters than the concordance model, 
we include apart from the concordance model the best fit WMAP 
cosmology (£l m = 0.27, = 0.73, 0s = 0.75 and h = 0.7) in our 
examinations and compare the constraints on the parameters of the 
two models. In particular, the change in has an influence on the 
expected number of clusters recoverable from Planck data. 

In order to estimate by how much constraints on cosmologi- 
cal parameters are improved by provided cluster redshift informa- 
tion, we further (optimistically) assume that cluster redshifts are 
known within some error for the complete Planck cluster sample. 
This raises questions about the feasability of obtaining cluster red- 
shifts for a major fraction or for the entire sample respectively. The 
presently least expensive way to measure cluster redshifts is by 
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Figure 4. Cosmological parameter constraints on Q.,,, and Cg from total cluster number counts on the full sky and without cluster redshift information at hand. 
The 68.3% (black solid line), 95.4% (red dashed line) and 99.7% (green dash-dotted line) confidence levels are shown. The two parameters are degenerate for 
total number counts. Without prior knowledge by other means only the shape of the degeneracy relation can be constrained. A liberal top-hat prior is placed 
on the Hubble constant (0.4 h $J 1). As fiducial cosmological model the concordance model has been assumed (Q.,„ = 0.3, Q.\ = 0.7, 0g = 0.9 and h = 0.7). 



performing multi-band near-IR and optical imaging observations. 
First, let us discuss how likely it will be at the time when Planck 
will have collected its data to have access to optical data for redshift 
estimations as needed here. With surveys, such as the Sloan Digital 
Sky Survey (SDSS0 and the Two-degree Field Galaxy Redshift 
Survey (2dFGRSo being already in place and even larger surveys 
being funded and becoming operational at about the time Planck 
finishes data collection (e.g. Pan-STARRS, expected to start scan- 
ning the sky in 2010), it is not too far-fetched to assume that for a 
major fraction of the sky optical data of high quality will be avail- 
able. This assumption of having redshift information available for 
a large number of Planck clusters is further supported by the fact 
that the median redshift of the Planck cluster sample of z ~ 0.2 
matches reasonably well the median redshifts of today's large scale 
galaxy surveys (i.e. the galaxy samples of the SDSS have median 
redshifts of z ~ 0. 104 (main galaxy sample) and z ~ 0.35 (luminous 
red galaxies) and that of the 2dFGRS is z w 0.11). 

Furthermore, as the Planck sample contains only (very) mas- 
sive clusters (commonly M c \ >5x 10 14 A^'Mq), clusters of the 
sample will have a high richness (number of member galaxies) and 
contain many bright galaxies. Further, since the massive clusters 
of the sample should exhibit a strongly developed red-sequence of 
galaxies, a deep two-band photometry might be an economical way 
to gain cluster redshifts of the precision needed for clusters located 
on the sky remotely from covered optical survey areas. Neverthe- 
less, due to the large uncertainties of the cluster position caused 
by the rather coarse Planck channel resolutions, pointed follow- 
up observations of single clusters might be a difficult and cum- 
bersome undertaking. This complicates pointed follow-up observa- 
tions in optical as well as X-ray wavebands (for a detailed discus- 
sion on Planck cluster sample follow-up at different wavelengths 
and on the expected p roperties of clusters detectable by Planck in 
other wavebands see lwhiteU2003h ). Even in the case of large sur- 
veys at optical and/or X-ray wavebands, the recovered positions 

18 http://www.sdss.org 

19 http://www.mso . anu.edu.au/2dFGRS 



of Planck detected clusters give only weak constraints for locat- 
ing associated cluster characteristic features in data collected over 
wide fields at these wavebands. A similar procedure of matching 
up Planck detected clusters with detections at other wavebands, as 
described in section IBT21 has rather to be adopted, after cluster can- 
didates have been located within a matching region at the respective 
other waveband under c onsideration. For the case of finding X-ray 
counterparts, the RASS jTriimperl Jl99l|) : IVoges~eI al. ( 1999)) is a 
good base for providing matches for Planck clusters with redshifts 
of z < 0.3. The main existing X-ray instruments used these days 
to detect clusters, XMM-Newton and Chandra, may be not opera- 
tional anymore at the time when Planck completes its data collec- 
tion. However, there should be a large overlap between a combined 
catalogue of cluster detections made by them and the future Planck 
sample. 

Since the Planck cluster sample consists mainly of low red- 
shift massive clusters (the Planck sample is unlikely to contain de- 
tections with redshifts z > 1 for the fiducial cosmological models), 
already quite shallow multi-band optical surveys covering a wide 
field (as the ones mentioned above) are well sufficient for cluster 
redshift determination. Having data of up-coming surveys, such as 
Pan-STARRS and the Large Synoptic Telescope, available eases 
the redshift hunt even further. In the following, we use a conserva- 
tive redshift binning of Az = 0. 1 to group clusters in redshift bins. 
Ideally one would like to have access to spectroscopic redshifts 
whose la precision is commonly better than Az = 0.01 for an indi- 
vidual galaxy in the redshift range of interest. However, availability 
of spectroscopic redshifts is likely to be limited. Nevertheless, pho- 
tomotric redshifts suffice as well for our purposes. For example, 
photometric redshifts derived from the five SDSS bands are accu- 
rate to Az ~ 0.03 for an individual galaxy. For a massive cluster 
hosting several detectable galaxies (N z . d \) the photometrically deter- 
mined redshift estimate precision increases proportional to ^J~N^\■ 
Even for a reasonably deep two-band photometry which uses the 
4000 angstrom break of the red cluster sequence galaxies, cluster 
redshift estimates obtained from colour-magnitude diagrams have 
uncertainties below Az ~ 0.05 for most of the considered redshift 
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Figure 5. Cosmological parameter constraints on Q.,„ and 08 from the full sky Planck survey cluster sample. 



range. Therefore, with cluster redshift determination on photomet- 
ric grounds, redshift bins of Az = 0.1 are well feasable. Further- 
more, our presumed choice of redshift binning is optimal in this 
respect that it avoids significant cross-correlations between seper- 
ate adjacent bins. Covariances between them can therefore be ne- 
glected in the analysis. 

At first, we derive constraints assuming a very weak prior on 
the Hubble parameter: 0.02 h 5 (actually this corresponds to 
h being unconstrained). The resulting obtainable constraints on £l„, 
and a% are shown in Figure[5]for the two fiducial cosmologies with- 
out requiring the geometry of the Universe to be flat. Since the 
detected cluster number is statistically relevant, due to the lower 
number of detected clusters in models with low values of rjg, as 
expected, the constraints derived for the WMAP fiducial model are 
weaker than the ones on the concordance model parameters. While 
the analysis is able to place reasonable constraints on rjg for both 
fiducial models (Arjg < 0.08 at all times), it is barely feasable to 
gain useful restraining information about the matter density £l m . 
However, an Einstein-de Sitter model (fl m = 1) can be excluded 
at high significance in both cases. Note that on the basis of the 
performed analysis the two fiducial models exclude each other at 
several (^> 3) a. 

Finally, we place a tight prior on h. For the further analy- 
sis we constrain the Hubble para meter to h = 0.7 ± 0.08 , as sup- 
ported by the HST Key Project ( Free dman et ID fcOOlh ). Figure 
[6] and [7] show the marginalised one-dimensional likelihood distri- 
butions of the four cosmological parameters, which have been al- 
lowed to vary in our analysis, for the two fiducial cosmologies. In 
addition to the central expectation and fiducial parameter values, 
various confidence intervals are plotted as well. The dotted central 
vertical line in each panel indicates the fiducial parameter value. 
The central thick solid line gives in each case the estimated pa- 



rameter value gained from the MCMC analysis. Hereby, the shown 
parameter estimate corresponds to the median of the particular dis- 
tribution. The other vertical lines give the quantiles of the distri- 
butions that are used to quote confidence limits on the parameter 
constraints. In each case confidence interval lines indicated by dif- 
ferent colours and linestyles enclose 68.3% (black solid/dotted), 
95.4% (red dashed) and 99.7% (green dot-dashed) confidence re- 
gions. Next to the respective confidence interval lines the corre- 
sponding confidence level is given in the same colour. The thin 
lines correspond to quantiles which enclose a particular percent- 
age of the samples of the contributing chains by intersecting the 
likelihood distribution at the same 'height' on both sides of the dis- 
tribution peak. Thus the area under the graph outside the interval 
limits adds, for example, (asymmetrically for a non-Gaussian dis- 
tribution) up in total to 31.7% of the entire area under the graph 
in the case of the 68.3% confidence region. However, both sides 
do not have to contribute the same area. The thick lines represent 
confidence limits obtained from restricting in each case the inte- 
grated areas under the graph to be the same on both sides of the 
distribution peak. This means that for the la confidence limit in- 
terval, 15.85% of the samples of the chains have values below the 
lower confidence limit and the same percentage of sample values 
lie above the upper confidence limit. Note that there is no assump- 
tion about the distributions being Gaussian. The horizontal dashed 
blue lines indicate exp (— x 2 /2) for x = 1,2 and 3 respectively. In 
the case a probability distribution is Gaussian, the intersections of 
the distribution with the respective line correspond to the la, 2a 
and 3a confidence limits respectively. Therefore, if the vertical and 
horizontal lines do not cross each other at the point at which they 
intersect the distribution graph, the distribution is non-Gaussian. 
Moreover, for a Gaussian likelihood distribution the two different 
ways of placing confidence limits (thin and thick lines) agree. Be- 
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Figure 6. One-dimensional marginalised probability distributions for the cosmological parameters varied in our MCMC analysis. There has been no direct 
restriction on the curvature £2^ = 1 — Q.,„ — Q.\ (no flatness prior). Apart from the Hubble parameter, the remaining parameter sub-space of the other parameters 
is sampled uniformly. Note that redshift number counts of cluster surveys are most sensitive to Cg and in the case of a tight prior on h constrain Q.,„. The 
marginalised distributions are obtained from thinned MCMC sample chains. Thinning ensures that chain samples used for estimating parameter confidence 
intervals are de-correlated. Chains are thinned in such way that the correlation between consecutive chain steps in the effective thinned chains are below 0.5 
when defining the correlation to be unity at zero step size. All over several million samples are taken to explore the likelihood distribution. We refer the reader 
to the text for an explanation of the different confidence intervals shown in the panels. Note that, even though a large number of samples has been taken, the 
eiTor on the 99.7% confidence level of each parameter is rather large due to a still small number of samples located outside this confidence level and thus a 
resulting large relative sample variance. 



sides for the Hubble parameter, the marginalised distributions for 
the most part deviate from a Gaussian one. Even the likelihood dis- 
tribution of the Hubble parameter which is a priori restricted to a 
Gaussian is marginally screwed which is caused by degeneracies 
between h and other variable parameters in the analysis. 

The parameter expectation values (median of the particular 
distribution) gained from the MCMC likelihood analysis together 
with the 68.3% confidence limits are listed in TablesQ]and[2]for the 
fiducial models. By comparing the parameter constraints of Tables 
[T]and[2] it can be seen, parameters are tighter constrained (confi- 
dence intervals are smaller) in the case of the concordance model. 
This is due to the higher number of detectable clusters in the con- 
cordance model. 

Tight constraints are especially placed on the variance of mat- 
ter fluctuations on scales of 8/i~'Mpc, as. This is even the case for 
a less restrictive h prior. Under the made assumptions and set pri- 
ors (fixed n s and Q.f, and prior on h), Planck cluster redshift number 
counts surpass recent primordial CMB power spectrum measure- 



ments in the ability to constrain Og. However, loosening the restric- 
tions on the spectral index n s and the baryon density Q.j, weakens 
constraints on rjg . On the other hand primordial CMB power spec- 
trum evaluations are well suited to constrain n s , fl/, and h. Thus, a 
combined analysis of the Planck cluster sample and the primordial 
CMB power spectrum recovered from Planck CMB data is down- 
right recommended. Moreover, combining cluster number counts 
with investigations of angular (possibly spatial) clustering of the 
galaxy clusters in the sample and estimates of their gas (baryonic) 
mass fraction from multi-waveband observations may as well re- 
sult in a further improvement of constraints which are based on the 
Planck cluster sample and its follow-up observations. 

Furthermore, it is also feasable to derive tight constraints on 
the matter density parameter Q.„, if a restrictive h prior is set (see 
Tables Q] and [2}. The constraining power of the Planck cluster sam- 
ple on £l m is comparable to the one obtained from the three year 
WMAP data alone in an ana lysis with six fr ee parameters assuming 
the Universe to be flat (see lSpergel J2006l) ). Even the dark energy 
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content an d the curvature £1^ can be constrained. However, they 
are the ones which of all the parameters in our analysis are least 
constrained. The 68.3% confidence limits for Q.^ are given in Ta- 
blesQ]and[2] By adding fi,„ and Q.^ of each MCMC chain sample 
the scatter of the sample curvature values around flatness can be 
investigated for each fiducial (flat) model since we have not placed 
priors in our analysis on the geometry of the Universe. Hence, one 
obtains: £1* = 1- (o.999±g$Q and £l k = 1- (o.999+g;}?o) re- 
spectively. 

The one-dimensional constraints given in Figures[6]and|7]and 
in Tables Q] and [2] fail to reveal important information hidden in 
parameter correlations and degeneracies. In order to display degen- 
eracies between parameters, the two-dimensional joint likelihood 
distributions for all possible pairs of parameters are shown in Fig- 
ures[8]and|9]for our fiducial models. 

The panels of Figures [8] and [9] display well-known degen- 
eracies between parameters constrained by cluster redshift num- 
ber counts. For example, the Q. m — Og degeneracy has been found 
by many othe r authors performing a n analysis on either simu- 
lated (see e .g. Battve & Weller ( 2003)) or observed data (see e.g. 
iBahcall & Bodd d2003h ). Further, the shown correlation between 
Q. m and £2^ is expected. However, since many authors restrict their 
analyses to flat models, this degeneracy has been much less studied. 
The region of acceptable values of £l m and ensures that the evo- 
lution of the growth factor of linear perturbations is approximately 



Parameter 


Median (la constraint) 






n q+0.014 




uoy/ -0.065 


h 


0.7t° °l (prior) 



Table 1. Derived parameter estimates from the MCMC analysis (sample 
distribution medians) and 68.3% confidence regions with interval limits 
given by the 15.85% and 84.15% percentiles. The shown constraints are 
for the concordance ACDM model. 

comparable to the one of the fiducial cosmologies in the redshift 
range of interest. Large deviations in the evolution of linear pertur- 
bation growth affect the cluster mass function and thus the expected 
redshift cluster number count and its slope above a limiting mass 
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Parameter Median (la constraint) 



£l m 0.26911 



i2 A 0.736«;|3 



h 0-7±g;g| (prior) 



Table 2. The same as Table [T] This time giving the median and la con- 
straints on the cosmological parameters for an underlying cosmological 
model with WMAP best-fit parameters. 

significantly. Moreover, the comoving volume element and the lim- 
iting mass - both redshift dependent - as well show for the allowed 
parameter combinations (region of high confidence) over the red- 
shift range covered by the Planck cluster sample only moderate 
variations from the respective values of the fiducial models. From 
these two previous degeneracies one can predict the — Og one 
(see right panel in the second row of Figures[8]and[9}. The Hubble 
parameter h is degenerate to £l m and Q.\. Though, for the found 
cluster selection it shows little degeneracy with fjg. 

Note that evidence of the multi-dimensionality of the degen- 
eracies becomes noticeable by comparison of Figure [5] with the 
left panel in the second row (£2 m -cig-plane) of Figure [8] and Fig- 
ure [9] respectively. The tight prior on h carves out regions around 
the parameters of each fiducial model to which the constraints are 
confined. These regions in the two-dimensional parameter space 
are localised within the two-dimensional constraints plotted in Fig- 
ure [5] Shifting the mean of the Gaussian prior to a lower value of 
h shifts the region of high confidence further to the right on the 
£2,„-axis and marginally downwards on the ag-axis. The opposite 
happens for an increase of the mean value of the Gaussian Hubble 
parameter prior (while keeping the fiducial parameters unaltered). 
This illustrates the strong degeneracy between the matter density 
£i m and the Hubble parameter h. Therefore, constraining h leads to 
a strong enhancement of the constraint on the matter density and 
slightly improves the constraint on o"g. 



7 DISCUSSION AND CONCLUSIONS 

The sky simulation(s) and the modelling of the observing process 
of the Planck Surveyor satellite presented in this work are of high 
realism and are based on recent observational constraints and pre- 
dictions obtained from numerical simulations. Nevertheless, as dis- 
cussed previously, some uncertainties concerning the component 
modelling remain. These might even have more than a marginal 
influence on the results. For example, the modelling of the num- 
ber counts of IR/SM point sources and their spatial correlation to 
galaxy clusters is speculative since available observational data are 
sparse. Apart from a few small patch observ a tions undertaken by 
SCUBA and MAMBO (see e.g. iGreve et all fc004l) ). there is lit- 
tle known about the point source population at submillimetre and 
millimetre wavelengths, resulting in high sample variance and ap- 



parently hardly any insight into correlations. Moreover, some chan- 
nels of the Planck HFI (100 GHz, 150 GHz and 353 GHz) to which 
IR/SM sources contribute are highly valuable for SZ cluster de- 
tections. Therefore, a higher level of point source contamination 
at these frequencies and/or them being (strongly) spatially corre- 
lated with clusters could affect Planck cluster number counts. Fur- 
thermore, cluster internal physical processes, such as AGN or SN 
cluster gas heating, may contribute to the cluster SZ signal in ad- 
dition to gravitational processes. So far, the mechanisms of these 
processes occuring at late cluster evolution stages are not well un- 
derstood. However, the Planck cluster sample should provide an 
extensive basis for studying such cluster physics. 

To the simulated data we have applied a cluster extraction al- 
gorithm. The method is a multi-frequency matched filtering tech- 
nique. It is based on a variational cluster template whose parame- 
ters are discretely varied. For this algorithm we optimised the pa- 
rameter discretisation with respect to algorithm performance and 
computing cost. Contrary to past analyses which have often re- 
stricted the template to be rigid (e.g. Gaussian beam shaped under 
the assumption that sources are unresolved), the priors on the tem- 
plate parameters have been chosen in such a way that they yield 
an optimisation of the cluster detection efficiency for the expected 
quality of the Planck data and cluster physical sizes. The recov- 
ered cluster catalogue is then constructed from candidates whose 
detection significance exceeds 5c. The built cluster catalogue has 
been found to be suitable for cosmological considerations under 
the condition that the survey selection for the data and the adopted 
algorithm is well understood. For a suitable parameterised se- 
lection function, a 'self-calibration' is as well feasable due to the 
large sample size. The extracted catalogue consist of clusters at 
moderate and intermediate redshifts with cluster masses generally 
°f M c \ > 5 x lO 14 A -1 Af0. The contamination of the catalogue is 
found to be fairly low. In general, an expected low sample con- 
tamination is a prerequisite in order to be able to use reliably the 
statistical power of such a large catalogue in addition to a compre- 
hensive sample completeness. 

Furthermore, comparing this work with the work presented in 
GKH05, it is found that the cluster selection is in good agreement 
with the one found in our previous paper (even though cluster ex- 
traction methods differ). In GKH05, due to the strict matching ac- 
ceptance region of ~ 2 arcmins, a certain number of clusters (< 10 
per cent above a flux limit of 2 x 10~ 3 arcmin 2 ) are not matched 
up correctly. These missed matches reduce the sample complete- 
ness estimate and increase on the other hand the contamination es- 
timate by approximately the same percentage. However, our previ- 
ous work aimed to give conservative estimates and reliable limits 
which should be definitely achievable by the Planck cluster sam- 
ple. On the contrary, overly large matching acceptance regions lead 
to over-estimates of the completeness and purity above survey flux 
detection thresholds. This is especially the case for low flux limits 
and high cluster surface densities, for which overly large accep- 
tance regions increase highly the possibility of finding a match just 
by chance. 

The precision of the photometric cluster parameters of the 
Planck samples recovered by the applied algorithms is (on the ba- 
sis of our simulations) expected to be rather poor. This is simi- 
lar to what we found in GKH05 using a different cluster recov- 
ery pipeline. Here, the (relative) dispersion (o(logF)) of the recon- 
structed cluster fluxes around their real values is estimated to be 



This assumption has been made throughout this work. 
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Figure 8. Two-dimensional confidence contours for all pairs of parameters. To obtain the confidence regions we marginalise in each case over the respective 
other parameters. In each case the contours enclose the 68.3% (black solid line), 95.4% (red dashed line) and the 99.7% (green dot-dashed line) confidence 
regions. The underlying fiducial model is the concordence ACDM model. 



approximately 15 percent for the whole sample on average for the 
recovery by the MFMF method. Though, it is found that the pho- 
tometric accuracy improves with increasing sample flux threshold. 
Nevertheless, we have not made use of recovered photometric clus- 
ter properties (namely the recovered cluster fluxes) in our parame- 
ter analysis. The found large dispersion reduces the usefulness of 
the recovered cluster fluxes for survey 'self-calibration' and cosmo- 
logical parameter constraints via accurate theoretical mass function 
predictions, a parameterised mass observable relation and selec- 



tion function. Likewise, the low photometric quality of the Planck 
cluster sample affects its cluster physical interpretation in the exact 
same manner. Therefore, we only touched briefly aspects of late 
cluster physics in our discussion above. Only global trends, such as 
the overall (average) normalisation of the M — Y scaling relation, 
may be grasped by the Planck sample (see GKH05). Constraints 
on the scaling relation normalisation, however, are expected to be 
degenerate with cosmological parameters, such as Og. One way to 
improve the understanding of cluster physics is to follow up the 
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Figure 9. The same as Figure[8]for the WMAP best fit fiducial cosmological parameter model (Q. m = 0.27, £1 A = 0.73, Os = 0.75 and h = 0.7). 



sample clusters with observations in the optical and X-ray wave- 
bands. As discussed in section [6] this is a rather cumbersome un- 
dertaking. It further has to be pointed out that a template choice 
differing from the actual universal one biases the photometric pa- 
rameter estimates (on average) in addition to the large dispersion. 
However, at the time Planck completes its data collection, several 
'small scale' SZ experiments will have finished their scientific pro- 
grammes and obtained results which will give insights into cluster 
physical aspects, such as cluster profiles, the normalisation of the 
M — Y relation and its evolution and intrinsic scatter. Therefore, 
our assumption of available prior information is realistic. These ex- 



periments can also be used to follow up the Planck sample in the 
microwave band enhancing resolution and cluster flux estimations. 

Besides optimising cluster extraction and forecasting cluster 
selection of two powerful algorithms applied to simulated Planck 
data, we focus on the cosmological prospects which can be accom- 
plished by the Planck cluster sample. This is the first time that 
based on a realistic selection function derived from astrophysical 
observation simulations and an implemented data analysis pipeline 
the cosmological use of the future Planck cluster survey is eval- 
uated. In our MCMC analysis to constrain cosmological param- 
eters, we assume a priori knowledge about the mass observable 
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relation and the cluster selection. As pointed out in the previous 
paragraph, limits on the M — Y relation normalisation will be ob- 
tainable before long by up-comming SZ cluster survey instruments 
and follow-up of their observations. Insights into the cluster selec- 
tion function can be achieved by mock observations as presented 
in this work. However, neglecting or misestimating the magnitude 
of contaminants leads inevitably to a bias in the expected selec- 
tion and therefore in the best-fit cosmological parameters obtained 
from the sample. Note that even in the case of a so-called 'self- 
calibration' analysis some advanced fixings of the parameterised 
(functional) shape of the selection have to be made. For example, 
a very common assumption is that the scatter about the mean lim- 
iting mass or value of the observable respectively is of Gaussian 
nature^ Apart from mock simulations as performed in this work 
to estimate the selection function, cross-checks of simulations with 
data and direct contaminant extraction from data are essential to 
investigate cluster selection. Reliably pinning down the selection 
function of the sample will be an iterative process in which results 
from mock simulations will have to be adjusted to observations in 
order to make them converge. A number of methods are available to 
seperate spatially and spectrally diverse components and thus help 
with estimating the flux selection of clusters. Powerful algorithms 
for disentangling components are, for example, Independent Com- 
ponent Analysis (ICA) and Maximum Entropy methods (MEM). 
They may also be applied in order to reduce contamination. How- 
ever, it is as well possible to estimate the confusion of the cluster 
fluxes by the MFMF cluster extraction algorithm itself by allowing 
the level of contamination to vary and including it in the parameter 
optimisation. 

Before summarising the results on constraints, some further 
challenges the parameter estimation will have to face are outlined. 
Firstly, uncertainties in cluster physics as well as in cosmological 
parameters and foregrounds (e.g. IR/SM point sources) suggest that 
the cluster number contained in the future Planck sample may be 
uncertain up to a factor of 2-3. The number of recovered clusters 
evidently controls the statistical ability of the sample to tighten con- 
fidence limits on the best-fit parameters. Hence, in section[6]the im- 
pact of different the Universe possibly underlying cosmologies on 
the ability to derive tight cosmological parameter constraints from 
the respective Planck cluster sample has been investigated. The two 
different fiducial cosmologies utilised are the concordance ACDM 
and the best-fit ACDM WMAP model. They differ mainly in their 
value of Gg, a parameter which is still fairly little constrained (by 
today's standards). Unsurprisingly, the sample with the fewer clus- 
ter members (lower value of Og) places the weaker constraints on 
the parameters (about a factor of 1.8 for the rjg constraint; see Ta- 
bles Q] and [2] and Figures [6] and |7J. Another concern which is not 
linked (directly) to the recovery process is the level of accuracy in 
the theoretical prediction of the cluster mass function. Compari- 
son of large scale numerical (N-body) simulations and halo finding 
codes estimate the current theoretical uncertainty to be at a level of 
approximately 10 percent. 

Disregarding the theoretical mass function uncertainty, our 
forecasts suggest that the Planck cluster sample will be able to put 
tight constraints on cosmological parameters additional to the ones 
derived from the primordial CMB power spectrum recovered from 
Planck data. Despite the sample's rather 'low' expected mean red- 



This assumes that the intrinsic scatter in the mass-observable relation 
and the (in quadrature) additive (extra) scatter in the reconstructed cluster 
fluxes caused by contamination are Gaussian. 



shift, due to the full sky coverage of the survey, cosmological pa- 
rameter constraints of similar quality as gained from current pri- 
mordial CMB analyses are realisable. The Planck cluster survey 
will especially have the capability to tighten constraints on fjg. The 
matter variance on scales of 8 /i~'Mpc is a parameter that can be 
only 'weakly' (in comparison to other parameters) constrained by 
primordial CMB measurements. All over, current primordial CMB 
observations on their own are not best suited to place constraints 
on the shape and normalisation of the matter power spectrum. The 
large difference of the best-fit Og value of the first and three year 
WMAP data (which show an approximately 2a discrepancy) indi- 
cates that the current primordial CMB parameter constraint should 
possibly not be taken too literally. This notion is strengthened by 
the fact that several other experiments obtained values discrepant to 
the WMAP constraint. Large scale structure observations of galax- 
ies and clusters are more suited to place tight constraints on the 
shape of the matter power spectrum and Og. However, most of the 
large scale structure surveys carried out up to the present day are 
of such small scale that they are heavily affected by sample vari- 
ance and ignored systematics may play a role as well. The Planck 
cluster sample can overcome these problems. Furthermore, degen- 
eracies between parameters, such as Sl m and obtained from pri- 
mordial CMB measurements and cluster number counts are differ- 
ent. Therefore performing a combined data analysis helps to narrow 
down regions of high likelihood in parameter space and to break the 
parameter degeneracies. Further, we find that while the constraint 
on Og is only weakly dependent on a prior on the Hubble parame- 
ter, constraints on the matter and dark energy density, Q. m and Q.^, 
depend strongly on it. By including a reasonable prior on h in the 
analysis as given, for example, by the HST Key Project, the con- 
fidence intervals of these parameters shrink by up to an order of 
magnitude. Nevertheless, even without such h prior, the bare exis- 
tence of a cosmological constant can be confirmed well above the 
3o confidence level. 

In conclusion, the gain of the Planck cluster catalogue will be 
twofold. Firstly, it will be a fruitful sample to serve as a base for 
studying cluster physics (the normalisation, evolution and intrinsic 
scatter of cluster properties and their scaling relations) in combi- 
nation with large scale surveys at other wavelengths (RASS, SDSS 
etc.) and/or follow-up in the microwave band. Secondly, our inves- 
tigations suggest that the Planck cluster sample (recoverable from 
future Planck data by algorithms like the one described above) can 
live up to the high expectations predicted from pure theoretical and 
analytical estimations by placing meaningful constraints on cos- 
mological parameters. Overall, an all-sky sample of massive clus- 
ters with a well understood selection function as achievable by the 
Planck mission will be of great value for cluster research and cos- 
mology. In a forthcoming paper we will investigate more sophis- 
ticated methods than MFMF to examine how their performances 
improve cluster extraction and constraints on cosmological param- 
eters. 
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